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Although the sign languages in use today are full human languages, certain of the 
features they share with gestures have been suggested to provide information about 
possible origins of human language. These features include sharing common articulators 
with gestures, and exhibiting substantial iconicity in comparison to spoken languages. If 
human proto-language was gestural, the question remains of how a highly iconic manual 
communication system might have been transformed into a primarily vocal communication 
system in which the links between symbol and referent are for the most part arbitrary. The 
hypothesis presented here focuses on a class of signs which exhibit: "echo phonology," a 
repertoire of mouth actions which are characterized by "echoing" on the mouth certain of 
the articulatory actions of the hands. The basic features of echo phonology are introduced, 
and discussed in relation to various types of data. Echo phonology provides naturalistic 
examples of a possible mechanism accounting for part of the evolution of language, with 
evidence both of the transfer of manual actions to oral ones and the conversion of units 
of an iconic manual communication system into a largely arbitrary vocal communication 
system. 

Keywords: sign language, echo phonology, language origins, neuroscience of sign language, mouth gestures 



INTRODUCTION 

In the past 50 years, the study of how human language evolved 
(evolutionary linguistics) has again become a prominent feature 
of linguistic discourse. A complete theory of language evolution 
is beyond the scope of this paper, including as it must, considera- 
tion of brain function and anatomical changes in the vocal tract. 
We are concerned here with only one part of the process — the 
previously hypothesized shift from a primarily gestural or vocal- 
gestural communication system to spoken language (see section 
Historical Perspectives below) and how such a shift could have 
provided a mechanism for converting iconic manual symbols into 
arbitrary vocal symbols. Data from the sign languages of Deaf 1 
communities will provide an insight into this mechanism. 

Since home signing (gesture systems) can appear in the 
absence of linguistic input (Goldin-Meadow, 2003), sign lan- 
guages used by Deaf communities have sometimes been regarded 
as primitive in comparison to spoken languages, and as represent- 
ing earlier forms of human communication. However, linguistic 
research over the past 40 years has demonstrated that sign lan- 
guages are in fact full natural languages with complex grammars 
(Stokoe, 1960; Klima and Bellugi, 1979; Sutton-Spence and Woll, 
1999). The creators and users of all known sign languages are 
humans with "language-ready brains." Nevertheless, it is possible 
that sign share features in common with evolutionary precursors 
of spoken language. 



^'Deaf" with an upper-case "D" is used to refer to membership of a sign 
language-using community and includes both hearing and (audiologically) 
deaf individuals. 



These features include sharing common articulators with non- 
linguistic communication (i.e., gestures), and exhibiting substan- 
tial iconicity in comparison to spoken languages. This iconicity is 
present in signs representing abstract concepts as well as in those 
that represent concrete objects and actions. The form of many 
signs [examples from British Sign Language (BSL)] depict part or 
all of a referent or an action associated with a referent, such as EAT, 
PAINT (holding and using a paintbrush), CAT (whiskers), BIRD 
(beak). Signs referring to cognitive activities (THINK, UNDER- 
STAND, KNOW, LEARN, etc.) are generally located at the forehead, 
while signs relating to emotional activities (FEEL, INTERESTED, 
EXCITED, ANGRY) are located on the chest and abdomen; signs 
with the index and middle fingers of the hand extended and sep- 
arated ("V" handshape) relate to concepts of "two-ness": TWO, 
BOTH, TWO-OF-US, WALK (legs), LOOK, READ (eyes). The per- 
vasiveness of iconicity (even where heavily conventionalized) is 
striking, in both sign languages and gestures. 

If human proto-language was gestural or vocal-gestural, the 
question remains as to how such a communication system with a 
high degree of iconicity might link to the development of artic- 
ulated words in spoken language, in which the links between 
symbol and referent are, for the most part, seen as arbitrary. 
Posing the question in this way, and regarding sign languages 
as "manual" ignores the rich and complex role played by other 
articulators: body, face, and, in particular, the mouth. 

As well as the actions performed by the hands, sign languages 
also make use of mouth actions of various types. The theory 
proposed here relates to one subgroup of mouth actions: "echo 
phonology" (Woll and Sieratzki, 1998; Woll, 2001). These are a 
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set of mouth actions unrelated to spoken language, and which 
occur obligatorily in a number of sign languages alongside certain 
manual signs. They are characterized by "echoing" on the mouth 
certain of the articulatory activities of the hands. 

Three data sources are discussed here: narratives in 3 differ- 
ent European sign languages, anecdotal observations of hearing 
individuals bilingual in BSL and English, and functional imag- 
ing studies with deaf signers. These provide evidence of a possible 
mechanism in the evolution of spoken language by which iconic 
symbols in a manual communication system could have con- 
verted into a vocal communication system with arbitrary links 
between symbol and referent. 

HISTORICAL PERSPECTIVES 

Many writers have suggested that human vocal language may 
have evolved from manual gestures. What is required to sustain 
such a claim is a plausible mechanism by which primarily man- 
ual actions could have transformed themselves into vocal actions. 
One mechanism (not even requiring communicative gesturing as 
an intermediate stage) was suggested by Darwin in The Expression 
of Emotions in Man and Animals (1872): 

"there are other actions [of the mouth] which are commonly 
performed under certain circumstances. . . and which seem to be 
due to imitation or some sort of sympathy. Thus, persons cut- 
ting anything may be seen to move their jaws simultaneously with 
the blades of the scissors. Children learning to write often twist 
about their tongues as their fingers move, in a ridiculous fashion." 
(Darwin, 1872, p. 34) 

Henry Sweet (1888) extended this notion to encompass a transi- 
tion from manual gesture to "lingual gesture": 

"Gesture., helped to develop the power of forming sounds 
while at the same time helping to lay the foundation of lan- 
guage proper. When men first expressed the idea of "teeth," "eat," 
"bite," it was by pointing to their teeth. If the interlocutor's back 
was turned, a cry for attention was necessary which would nat- 
urally assume the form of the clearest and most open vowel. A 
sympathetic lingual gesture would then accompany the hand ges- 
ture which later would be dropped as superfluous so that ADA 
or more emphatically ATA would mean "teeth" or "tooth" and 
"bite" or "eat," these different meanings being only gradually 
differentiated." (Sweet, 1888, pp. 50-52) 

To Sweet, therefore, should go the credit for hypothesizing that 
a "lingual gesture accompanying a natural hand gesture" could 
be a key link between gesture and spoken language. However, he 
provides no evidence for such a process, failing to explain more 
general features of what he calls sympathetic lingual gestures. 

Richard Paget (1930) attempted to find evidence for such a 
theory. Like Sweet, Paget claimed that the earliest human language 
was a language of gestures, in which manual actions were uncon- 
sciously copied by movements -of the mouth, tongue, or lips. 

"Originally man expressed his ideas by gesture, but as he ges- 
ticulated with his hands, his tongue, lips and jaw unconsciously 
followed suit. . . The consequence was that when, owing to pressure 
of other business, the principal actors (the hands) retired from 



the stage. . . their understudies — the tongue, lips, and jaw — were 
already proficient in the pantomimic art." (Paget, 1930, p. 133) 

He supplies a number of examples of this process: 

"Another . . . example may be given, namely, in connection with 
the beckoning gesture — commonly made by extending the hand, 
palm up, drawing it inwards toward the face and at the same time 
bending the fingers inwards toward the palm. This gesture may 
be imitated with the tongue, by protruding, withdrawing, and 
bending up its tip as it re-enters the mouth. 

If this "gesture" be blown or voiced, we get a resultant whis- 
pered or phonated word, like eda, eda, or edra . . . suggestive of 
. . . our English word "hither"." (Paget, 1930, p. 138) 

Paget's theory (known as the "ta-ta" theory from another exam- 
ple suggesting parallels between waving goodbye and flapping the 
tongue) was developed further by Swadesh (1971). He provides 
another example of its application: 

"... a word like the Latin capio, I take, or English capture, whose 
root begins with a k sound and ends in the sound p, made by clos- 
ing the lips. It has been suggested that the formation of the k sound 
at the back of the mouth, while the lips are open, is comparable 
to the open hand. The closing of the lips, then, is analogous to 
the fingers closing with the thumb as one takes hold of an object. 
Thus the pronunciation of the root capio is like the action of tak- 
ing. Of course not all words are to be explained in this way; in fact, 
only a few. And yet the possibility that some words developed in 
this way is not denied by other qualities also evident in language." 
(Swadesh, 1971, p. 4) 

Paget's theory can only be validated if there is evidence for a 
historical process by which manual gestures were reflected in 
movements of the lips and tongue, which were in turn associ- 
ated with the production of speech-sounds. One weakness of the 
approach of Paget and the others is that they all suggest that 
the mouth actions share underlying imagery with the associated 
iconically- motivated manual gesture, leaving open the question of 
how a hypothesized highly iconic manual communication system 
could have subsequently led to spoken language, with its generally 
arbitrary links between symbol and referent. 

Hewes (1973) serves as a point of connection between the 
writers of the late nineteenth and early twentieth century and con- 
temporary writings on language evolution. Kendon (2010) in a 
review of Fitch (2010) summarizes Hewes' view that primate ges- 
tures served as a better point of comparison with human language 
than their vocalizations. Hewes did recognize, however that a 
challenge for a gestural origin of human language was the need to 
account for the switch from manual to vocal communication. His 
suggested reasons included the greater convenience of speaking 
(it could be used in the dark and while the hands were occupied), 
and an increase in vocabulary and ease of lexical retrieval. He also 
supported Paget's (1930) hypothesis, discussed above. 

Recent studies (Erhard et al., 1996; Rizzolatti and Arbib, 1998; 
Rizzolatti and Craighero, 2004) provide such evidence of links 
between brain areas associated with language and areas control- 
ling movement of the hands and arms (also see below). However, 
such findings have not been used to suggest a mechanism in 
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language evolution for the twin shifts from hand to mouth and 
from iconic to arbitrary symbols. 

CONTEMPORARY EVIDENCE 
NEUROBIOLOGICAL PERSPECTIVES 

Studies of neurons in the monkey brain by Rizzolatti and 
colleagues since 1996 (Rizzolatti et al, 1996; Rizzolatti and 
Craighero, 2004) have identified "mirror neurons," which fire 
when a primate observes another individual (monkeys and 
humans) making specific reaching and grasping movements. The 
mirror system, in temporal, parietal, and frontal regions, is part of 
a system specialized for perceiving and understanding biological 
motion. Although research has not shown a mapping of vocaliza- 
tion production onto perception of vocalizations, this mapping 
is implicit in Liberman and Mattingly's (1985) motor theory of 
speech perception, which proposes that speech is understood in 
terms of its articulation, rather than its perception. It should also 
be noted that the anatomical proximity of neurons in the pre- 
motor cortex relating to hand and mouth functions may relate 
evolutionarily to the involvement of both in activities such as 
eating. The relationships between mouth actions related to eat- 
ing, and mouth actions found in spoken language, have been 
discussed in detail by MacNeilage (1998). Meguerditchian and 
Vauclair (2008), describe shared features in the co-occurrence of 
manual and vocal gestures in non-human primates. 

In a series of studies, Gentilucci and colleagues have shown 
that mouth actions are related to manual actions. When partici- 
pants were asked to grasp objects of different sizes while articulat- 
ing syllables such as /ba/ there was a parallel increase in the mouth 
opening and voice spectra of syllables pronounced simultane- 
ously. Semantically congruent words and gestures also show inter- 
action effects not seen in incongruent pairings (Gentilucci, 2003; 
Gentilucci and Corballis, 2006; Barbieri et al, 2009). Bernardis 
and Gentilucci (2006), describing the relationship of words and 
emblems in processing and execution, hypothesize that a sys- 
tem relating actions to syllables might have evolved into a system 
relating symbolic gestures to words, and importantly, draw on 
neurological evidence about the role of Broca's area in both 
gesture and language. 

GESTURE AND SPEECH 

A number of theorists have postulated that manual gesture (on its 
own, without consideration of vocalization or mouth gesture) is 
the origin of language. Rizzolatti and Arbib (1998) align with the 
earlier nineteenth and twentieth century writers, seeing gesture as 
fading once speech has emerged: 

"Manual gestures progressively lost their importance, whereas, 
by contrast, vocalization acquired autonomy, until the relation 
between gestural and vocal communication inverted and ges- 
ture became purely an accessory factor to sound communication" 
(Rizzolatti and Arbib, 1998, p. 193). 

In such models, gesture is seen as unintegrated with speech — both 
in modern human communication and in human evolution. 

McNeill et al. (2008) provides a strong set of arguments against 
this position. They argue that a unimodal communication system, 



using gesture or sign alone, could not have evolved into mod- 
ern human communication, which is primarily bimodal (gesture 
and speech). They suggest that if such a phase existed, it was not 
a proto-language, but a precursor of mimicry and pantomime. 
They argue that a "gesture-first" theory: 

"incorrectly predicts that speech would have supplanted gesture, 
and fails to predict that speech and gesture became a single sys- 
tem. It is thus a hypothesis about the origin of language that 
almost uniquely meets Popper's requirement of falsifiability — and 
is falsified, doubly so in fact. (McNeill et al., 2008, p. 12)" 

Another thread in the "supplantation of gesture by speech" argu- 
ment relates to the advantages of speech over gesture (Corballis, 
2003). McNeill et al. (2008) have argued that speech is the default 
form of human communication because it has fewer dimensions, 
is more linear, is non-imagistic (and hence more arbitrary, with 
the potential for a larger lexicon), etc. Given this asymmetry, 
McNeill and colleagues argue that even though speech and ges- 
ture are selected jointly, it would still be the case that speech is the 
medium of linguistic segmentation: 

"Sign languages — their existence as full linguistic systems — 
impresses many as a reason for gesture-first, but in fact, histori- 
cally and over the world, manual languages are found only when 
speech is unavailable; the discrete semiotic then transferring to the 
hands. As we shall see later, this transfer takes place automatically. 
So it is not that gesture is incapable of carrying a linguistic semi- 
otic, it is that speech (to visually disposed creatures) does not carry 
the imagery semiotic." (McNeill et al., 2008, p. 13) 

HANDS AND MOUTH IN SIGN LANGUAGE 

MOUTH ACTIONS AND OTHER NON-MANUAL ARTICULATORS 

As mentioned above, sign languages of the deaf offer a unique 
perspective on language, since they embody the structural and 
communicative properties of spoken language, while existing 
entirely within a wholly visual-gestural medium. Among other 
insights, they enable investigators to clarify the core components 
of language in distinction to those that reflect input or action 
characteristics of the language system. This difference is reflected 
in the articulators on which languages in the two modalities 
rely. Sign languages use both manual and non-manual articu- 
lators, including the head, face and body (e.g., Liddell, 1978; 
Sutton-Spence and Woll, 1999). Within the face, eye actions 
such as eye narrowing, changes in direction of gaze and eyebrow 
actions (raise/lower) play important roles in SL communication 
(Crasborn, 2006). In addition, although sign languages are not 
historically related to the spoken languages of their surrounding 
hearing communities, sign languages do borrow elements from 
spoken language (Sutton-Spence and Woll, 1999). Thus some 
mouth actions (mouthings) are derived from spoken language, 
while other mouth actions (mouth gestures) are unrelated to 
spoken languages (see Figure 1 below). 

In a study of narratives in three European sign languages 
(Crasborn et al, 2008) mouth actions were found throughout 
(Table 1). There is striking uniformity in the percentage of signs 
accompanied by mouth gestures (35-39%), with greater variation 
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across the three languages in the percentage of signs accompanied 
by mouthings (26-51%). 

Mouthings 

Sign languages can borrow mouth actions from spoken words — 
speech-like actions accompanying manual signs that can disam- 
biguate manually homonymous forms. These are considered to 
be borrowings, rather than contact forms reflecting bilingualism 
in a spoken and signed language, since there is evidence that sign- 
ers can learn these without knowing the source spoken language. 
These serve to disambiguate "manual homonyms": signs with 
similar or identical manual forms. For example, the BSL signs 
ASIAN and BLUE, are manually identical (see Figure 3C below). 
To distinguish which meaning is meant, mouthings are incorpo- 
rated, derived from the mouth actions used when speaking the 
words "Asian" or "blue." 

Adverbials 

Adverbials are arrangements of the mouth which are used to 
signal manner and degree (e.g., to indicate that an action is per- 
formed with difficulty or with ease; to indicate if an object is very 
small or very large, etc.). In Enaction (sometimes called mouth- 
for-mouth), the action performed by the mouth represents that 
action directly (e.g., in CHEW, the mouth performs a "chewing" 
action, while the sign is articulated on the hands). 

ECHO PHONOLOGY 

The term Echo Phonology (Woll and Sieratzki, 1998; Woll, 2001, 
2009) is used for a class of mouth actions that are obligatory 
in the citation forms of lexical signs. In the BSL sign TRUE (see 
Figure 3D below), the upper hand moves downwards to con- 
tact the lower hand, and this action is accompanied by mouth 
closure, synchronized with the hand contact. This category of 
mouth gesture differs from adverbial mouth arrangements as the 
mouth gesture forms part of the citation form of the manual 



Non-manual actions in sign languages 

| (after Woll. 2001 J 

Mouthings Mouth gestures 

Derived from spoken language j [ Not derived from spoken language ] 






In loans f In code blending 1 

{ e.g. to disambiguate J ( 


jAdverhials. enaction, etc. j 


FIGURE 1 | Mouth actions in sign language. 


| ECHO PHONOLOGY J 



sign, and unlike adverbial mouth gestures, do not carry addi- 
tional meaning. Crasborn et al. (2008) refer to this category of 
mouth gestures as "semantically empty." Signs with echo phonol- 
ogy appear incomplete or ill-formed in their citation form if the 
mouth gesture is not present. 

The term "echo phonology" is used, since the mouth action 
is a visual and motoric "echo" of the hand action in a number 
of respects: onset and offset, dynamic characteristics (speed and 
acceleration) and type of movement (e.g., opening or closing of 
the hand, wiggling of the fingers). Echo phonology mouth ges- 
tures are not derived from or related in any other way to mouth 
actions representing spoken words; in the citation form of these 
signs they are an obligatory component, and are presumably con- 
strained by the common motor control mechanisms for hands 
and mouth discussed above. The citation forms of the signs in 
which they are found require the presence of the mouth ges- 
ture to be well-formed, and the mouth gesture always includes 
some movement such as inhalation or exhalation, or a change in 
mouth configuration (opening or closing) during the articulation 
of the sign: for example, BSL signs EXIST (wiggling of fingers, 
no path movement, accompanied by [JJ]); TRUE (active hand 
makes abrupt contact with palm of passive hand, accompanied 
by [am] — see Figure 3D below); DISAPPEAR (spread hands close 
to "flat o" shape, accompanied by [9p] ). 

The essential dependence of the mouth gesture on the articu- 
latory features of the manual movement can be seen in three BSL 
signs all meaning "succeed" or "win." Three different oral pat- 
terns of mouthing co-occur with these signs, and one cannot be 
substituted for the other. In SUCCEED, the thumbs are initially in 
contact, but move apart abruptly as the mouth articulates [pa]. 
In WIN, the hand rotates at the wrist repeatedly as the mouth 
articulates [hy]; and in WON, the hand closes to a flat O, while 
the mouth articulates [Ap]. Most importantly, the action of the 
mouth in signs with echo phonology, while echoing that of the 
hands, is not in itself iconic. 

The parallel movements of the hands and mouth found in echo 
phonology can also be seen in the production of the BSL sign 
DISAPPEAR (Figure 2). Both hands are open and the tongue is 
protruding at the onset of the manual sign. The notation tiers 
show that during the movement of the sign, as the hands close, 
the tongue retracts. 

SYLLABLES OCCURRING IN ECHO PHONOLOGY IN BSL 

The following elements (Table 2) have been identified, although 
it is likely that this is not an exhaustive list. It is not known what 
inventories exist in other sign languages. Some articulatory fea- 
tures are given for them; and since echo phonology is a feature of 
a language used by deaf people, no voiced-voiceless distinction is 



Table 1 | Comparison of hand/mouth actions in three sign languages. 

Language Number of Number of % of signs accompanied Number of % of signs accompanied 

Manual signs mouth gestures by mouth gestures mouthings by mouthings 

British Sign Language 1552 539 35 560 36 

Sign Language of the Netherlands 1162 458 39 299 26 

Swedish Sign Language 1619 624 39 831 51 
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FIGURE 2 | BSL sign disappear showing initial configuration of mouth. 



Table 2 | Echo phonology elements in BSL. 



CONSONANTS— SYLLABLE-INITIAI 



p 


Bilabial stop 


f 


Labio-dental stop 


CONSONANTS-SYLLABLE-FINAL 




P 


Bilabial stop 


m 


Bilabial nasal 


S 


Glottal stop 


VOCALICS 


y 


Front rounded vowel 


a or a 


Low central vowel 


u 


Back rounded vowel 


h 


Pharyngeal fricative 


w 


Bilabial fricative 


8 


Interdental fricative 


J 


Rounded palatal fricative 


BREATH PATTERNS 


< 


Exhalation 


> 


Inhalation 



operative and almost all involve articulations at the front of the 
mouth or lips, where they are most visible. 

The combinations of these elements result in syllables. 
Selected examples of signs using these syllables are given 
(Table 3). 

Although echo phonology is largely voiceless in deaf sign- 
ers, hearing people with deaf parents (bilinguals native in both 
BSL and English) frequently mix sign and speech, either in the 
form of code-mixing (switching between English and BSL) or — 
because these languages occur in different modalities (bimodal 
bilingualism) — by means of code blending, where elements from 
a spoken language appear simultaneously with elements of a sign 
language. 

Anecdotal observations from conversations between hearing 
people with deaf parents (bilinguals native in both BSL and 
English) indicate that echo phonology appears (with or without 
voicing) in the form of code mixing with English in the absence 



Table 3 | Examples of syllables with echo phonology. 

< done 
pa 

Associated with one or two active hands, movement consists of hand 
separation and twisting, with single sharp action 

< < < not-bothered 
fu or fw or fy 

Body contact at start of sign, movement downwards and away from body 

< exist 
J 

Wriggling or fingers, repeated shaking or twisting of wrists(s), no path 
movement 

< < win 
hw or hy 

Repeated twisting of wrist, no path movement 

> thank-god 
Ap 

Closing and twisting of hand(s), sharp movement 

> > true 
am or Am 

Hand closes and contacts passive hand, sharp movement 

> disappear (also see Figure 2) 

Op 

Hand(s) close, sharp movement with abrupt stop 



of production of the manual component. In other words, only the 
oral component is produced. 
Examples include: 

1. A: "Have you done that poster?" 

B: "[ JJJ] (NOT- YET), I'll do it tomorrow" (voiceless) 

2. A: "It was terrible. [Amp]'" (END/absolutely over) (voiced) 

3. A: "I couldn't get a straight answer from anyone. It was 
completely [pipipi] " ( VARIED/inconsistent) (voiced) 

These examples are suggestive of a possible leap from echo 
phonology in signs to a situation where voicing accompanies 
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these mouth gestures so that they begin to have independent exis- 
tence as lexical items. Further research is necessary to explore 
whether these forms are more similar to vocal gestures or to 
words. 

Sweet, Paget and the other early writers cited above postulated 
that iconicity in the mouth gesture itself was the source of spoken 
words. However, it is difficult to see how a mouth gesture on its 
own could ironically express the semantic notion of "succeed" or 
"true." Echo phonology illustrates a mechanism by which abstract 
concepts, which can be represented by iconic manual gestures, can 
be attached to abstract mouth gestures. 

ECHO PHONOLOGY IN DIFFERENT SIGN LANGUAGES 

In a study comparing narratives in three sign languages, the 
occurrence of echo phonology was compared with other types 
of mouth action. The data are drawn from the ECHO (European 
Cultural Heritage Online) corpus. This corpus was created as part 
of a European Union pilot project with the aim of demonstrating 
how scientific data within the humanities (including linguistics) 
can be made widely accessible via the Internet (Crasborn et al., 
2007). 

Data were collected from one male and one female Deaf native 
signer of each of BSL, NGT, and SSL — a total of six signers. After 
reading brief summaries in order to familiarize themselves with 
the content, signers were asked to sign to camera their own ver- 
sions of five of Aesop's fables. Data were then coded with ELAN 
software, using a broadly defined set of transcription categories. 
In all, 5 1 min of signed material were included in this study. All 
annotated data from this study is freely available at the ECHO 
web site: http://www.let.ru.n1/sign-lang/echo.2 

Echo phonology was found in all three sign languages. Of 
mouth gestures found in the narratives (i.e., excluding signs with 
mouthing), signs with echo phonology form 10.8% of mouth ges- 
tures in BSL, 12.6% in Sign Language of the Netherlands, and 
16% in Swedish Sign Language (Crasborn et al., 2008). 

Echo phonology has also been studied in other sign lan- 
guages, including German Sign Language (Pendzich, 2013) and 
American Sign Language (Mather and Malkowski, 2013). Mather 
and Malkowski explored opening and closing movements of the 
mouth in detail, in particular, how mouth closing occurs when the 
hands contact the body, and mouth opening occurs when hand 
contact with the body is broken. 

NEURAL CORRELATES OF ECHO PHONOLOGY 

Despite the differences in the modality of the perceived signal, the 
neural organization of language is remarkably similar in spoken 
and signed language. Neuroimaging studies of native signers show 
similar patterns of lateralization and activation when processing 
spoken or signed language data. Specifically, sign language pro- 
cessing is associated with activation in left temporal and frontal 
cortex, including Broca's area (BA 44/45), just as for spoken lan- 
guage, (see e.g., Emmorey, 2001; Corina et al., 2003; MacSweeney 
et al., 2008; Newman et al., 2010 for a review). MacSweeney et al. 
(2002) also found no differences between BSL and English in 
the extent of lateralization, with both languages left-lateralized. 
Studies of patients with brain lesions following CVA consistently 
indicate that perisylvian regions of the left hemisphere support 



language processing (Atkinson et al., 2005; see Woll, 2012 for a 
review). 

Despite their similarities, the networks for spoken and sign 
language are not completely identical. MacSweeney et al. (2002) 
report that regions which showed more activation for BSL than 
audiovisual English included the middle occipital gyri, bilaterally, 
and the left inferior parietal lobule (BA 40). In contrast, audio- 
visual English sentences elicited greater activation in superior 
temporal regions than BSL sentences (pp. 1589-1590). 

With these considerations in mind, Capek et al. (2008) 
explored the sensitivity of the cortical circuits used for language 
processing to the specific articulators used, not only comparing 
speech and signing but examining activation during perception 
of signs with English mouthing, with echo phonology, and with 
no mouth actions. In their fMRI experiment, lists of lexical 
items were presented to deaf native signers. These comprised: 

(1) silently articulated English words with no hand action (SR); 

(2) BSL signs with no mouth action (hands only — Man); (3) 
BSL signs with mouthings (disambiguating mouth, where the 
mouthing distinguished between two manually identical signs — 
DM); and (4) BSL signs with echo phonology (EP). 

The stimuli were designed to vary on the dimensions of pres- 
ence or absence of mouth opening/ closing; presence or absence 
of hand and arm movements; and presence or absence of English- 
based mouth actions (Table 4). 

Stimuli consisted of single words/signs, examples of which are 
given in Table 5. The list of silently spoken words was based on 
English translations of the signs. 

Figure 3 shows examples of each of the stimulus types: 

Thirteen (6 female; mean age 27.4; age range: 18-49) right 
handed participants participated. Volunteers were congenitally 
deaf native signers, having acquired BSL from their deaf parents. 
Stimuli were presented in alternating blocks of each of the exper- 
imental and a baseline condition. In order to encourage lexical 
processing, participants performed a target-detection task. Full 



Table 4 | Characteristics of stimuli in fMRI experiment. 



Mouth 
opening 
and closing 



Hand- arm English- 
movements derived 
(BSL) mouth 



No mouth (HO) 

Phonology (EP) + 

Disambiguating mouth (DM) + 

Silent speech (SS) + 



+ 
+ 



+ 
+ 



Table 5 | Examples of stimuli in fMRI study (EP syllables in brackets). 



EP 

EXIST [uj] 
WIN [hy] 
NONE [pu] 
SUCCESS [pa] 
END [pam] 



DM 

FINLAND/METAL 

BATTERY/AUNT 

WOOD/PROBLEM 

RUSSIA/BOY 

ITALY/WIN 



Man 

TABLE 

CHERRY 

BUTTER 

KNOW 

FAX 
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details of the experimental protocol and analysis may be found 
inCapek etal. (2008). 

SIGN LANGUAGE (MAN, DM, EP) 

In all three sign language conditions, Deaf native signers acti- 
vated core language regions that are typically found when hearing 
people listen to speech. Although both sign language and speech 
involve perisylvian regions, sign language perception activated 
more posterior and inferior regions (Figure 4). 



A B 




FIGURE 3 | Illustrations of stimuli. (A) SS, Silent articulation of the 
English word "football." The fricative (/f/)("foot.."), and the semi-open vowel 
Iv.l ('.'.ball") are clearly visible. (B) Man, The BSL sign ILL. (C) DM, The BSL 
sign ASIAN shows the mouthing of /el/ and /3/. The face insets show the 
corresponding parts of the mouthings for the manual homonym BLUE, 
where /b/ and /u:/ can be seen. (D) EP The manual sequence for [TRUE] 
requires abrupt movement from an open to a closed contact gesture. As 
this occurs, the mouth closes abruptly. 



COMPARING ECHO PHONOLOGY (EP) AND OTHER MOUTHINGS (DM) 

The task required participants to process material linguistically. 
In order to achieve lexical processing, BSL users must integrate 
perceptual processing of hands and of face/head, and this needs 
to be achieved fluently and automatically. If the cortical circuitry 
for sign language processing were driven by a mechanism that 
is "articulation-blind," we would expect there to be no system- 
atic differential activation between signs with mouthings (where 
the mouth information is non-redundant), signs with no mouth 
action, and signs with echo phonology. Yet the contrasts found 
suggest this is not the case. 

DM generated relatively greater activation in a circum- 
scribed region of the left middle and posterior portions of 
the superior temporal gyrus (resembling the speech reading 
condition), while EP produced relatively greater posterior acti- 
vation (Capek et al, 2008, p. 1231). We can consider the 
four conditions to represent a continuum from speech (SR) 
to speech accompanying signs (DM) to signs with accompa- 
nying non-speech-like mouth actions (EP) to purely manual 
signs (Man). Since greater posterior activation is characteristic of 
more sign-like material, EP also occupies an intermediate posi- 
tion between signs without mouth and signs with mouth actions 
derived from spoken language (Figure 5) in terms of neural 
activity. 

The comparison of mouthings (DM) and echo phonology 
(EP) provides information about the nature of the mouth move- 
ments, and their role in sign language processing. The only 
differences in activation between DM and EP signs were found 
in the temporal lobe, with echo phonology (which is not derived 
from speech) demonstrating relatively greater posterior activation 
in both hemispheres than DM. This can be interpreted as a cor- 
tical correlate of the claim that the hands are indeed "the head 
of the mouth" (Boyes-Braem and Sutton-Spence, 2001), for echo 
phonology, as proposed by Woll (2001). While DM resembles 
speechreading in terms of functional cortical correlates, activation 




FIGURE 4 | Brain activation. (A) Activation during silent speechreading (SS). (B) Activation during processing of signs without any mouth actions (HO). (C) 
Activation during processing of signs with disambiguating mouth actions (DM). (D) Activation during processing of signs with echo phonology (EP). 
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Disambiguating Mouth vs. 
Echo phonology 




FIGURE 5 | Contrast between activation for DM (yellow) and EP signs 
(blue). 



for EP resembles that for manual-only signs. Thus EP appears to 
occupy an intermediate position between spoken words and signs. 

CONCLUSIONS 

One issue for those concerned with suggesting a link between ges- 
ture and word has always been how the arbitrary symbol-referent 
relationship of words in spoken language could have come from 
visually-motivated gestures. Echo phonology provides evidence 
for a possible mechanism. Firstly, the phenomenon appears 
to be fairly common across different sign languages (although 
the occurrence of echo phonology remains to be researched 
in non-European sign languages). Secondly, the mouth actions 
found in echo phonology are themselves non-visually moti- 
vated. For example, signers report that BSL EXIST is iconic 
(Vinson et al., 2008), indicating "something located there," but 
it is impossible to reconstruct from the echo phonology syl- 
lable which accompanies it [J] the meaning "exist," Thirdly, 
the actual inventory of elements in echo phonology looks very 
much like a system of maximal contrasts in a spoken lan- 
guage phonology (although there are some limitations because 
of the absence of sound contrasts). Fourthly, functional imaging 
research on the representation of signs and words in the brain sug- 
gests that echo phonology occupies an interesting intermediate 
position. 

This paper represents a preliminary exploration of echo 
phonology. However, the data lead us to a number of conclu- 
sions. They support the arguments of those who argue against 
the notion that a unimodal manual protolanguage preceded 
the evolution of spoken language, since they demonstrate the 
extent to which signs are combined with mouth actions. The 
data also provide a window onto a mechanism by which the 
arbitrary pairing of a referent with a symbol (Saussure's defin- 
ing feature of spoken language) could have occurred. Further 
research is needed to explore the presence of echo phonology 
in other sign languages (including those with a more recent 
point of creation than BSL) and whether echo phonology is 
subject to change (for example, added or transformed in a pro- 
cess of sign conventionalization). These studies may provide 
more insights into the origins of phonological/lexical structure 
in spoken language, and from that to the evolution of human 
language. 
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